Multithresholding of mixed-type documents
نویسندگان
چکیده
Mixed-type documents include text, drawings and graphics regions. It is obvious that a technique that can reduce the number of the gray-levels in accordance with the type of each document region could be important for many document applications, such as storage, transmission and recognition. To solve this problem, this paper proposes a new method, called the document multithresholding technique. The method is based on a page layout analysis (PLA) technique and on a neural-network multilevel threshold-selection approach. The proposed technique is applicable to any mixed-type document and achieves document multithresholding by taking advantage of the types of the document blocks. Thus, in the ®nal document dierent block types are stored with the appropriate and limited numbers of gray-level values. The proposed method includes two main steps. First, a PLA technique is applied, which classi®es the document blocks into text, line-drawing and graphics regions. In the second stage, a new neural-network multithresholding technique is applied to each of the document blocks. In text and linedrawing blocks, only one threshold is determined, whereas in the graphics blocks the optimal number of thresholds is ®rst determined. The performance of the method has been extensively tested on a variety of documents. Several examples illustrate the strength and the eectiveness of the proposed methodology. 7 2000 Elsevier Science Ltd. All rights reserved.
منابع مشابه
Color image segmentation using histogram multithresholding and fusion
A novel method for multiband image segmentation has been proposed. The method is based on segmentation of subsets of bands using multithresholding followed by the fusion of the resulting segmentation achannelso. For color images the band subsets are chosen as the RB, RG and BG pairs, whose two-dimensional histograms are processed via a peak-picking algorithm to effect multithresholding. The seg...
متن کاملPii: S0262-8856(99)00015-3
One of the most frequently used methods in image processing is thresholding. This can be a highly efficient means of aiding the interpretation of images. A new technique suitable for segmenting both gray-level and color images is presented in this paper. The proposed approach is a multithresholding technique implemented by a Principal Component Analyzer (PCA) and a Kohonen Self-Organized Featur...
متن کاملthe identify the dimensions and components of effective teachers in primary schools in Iran: mixed method
This study aims to identify the dimensions and components of effective teachers in primary schools in Iran. The approach research due to the application of Q-type is mixed method (qualitative-quantitative). The study population included all members of the Faculty of Psychology and Educational Sciences of the University of Tehran. Of which 20 were selected as a combination of purposeful sampling...
متن کاملLicense Plate Recognition System using Neural Networks and Multithresholding Technique
License plate recognition is a fully automated real time technique that has been widely used for identification, theft control and security validation of vehicles. For recognition and extraction of desired regions of the number plate of the vehicle, different algorithms are used. An image processing technology based on license plate recognition (LPR) that is being used to identify vehicles, usi...
متن کاملComparison results on the preconditioned mixed-type splitting iterative method for M-matrix linear systems
Consider the linear system Ax=b where the coefficient matrix A is an M-matrix. In the present work, it is proved that the rate of convergence of the Gauss-Seidel method is faster than the mixed-type splitting and AOR (SOR) iterative methods for solving M-matrix linear systems. Furthermore, we improve the rate of convergence of the mixed-type splitting iterative method by applying a preconditio...
متن کامل